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Absti’act 


In  this  paper  we  use  interior-point  methods  for  linear  programming,  developed  in  the 
context  of  sequential  computation,  to  obtain  a  parallel  algorithm  for  the  bipartite  matciiing 
problem.  Our  algorithm  runs  in  0*{y/rn)  time^.  Our  results  extend  to  the  weiglited  bipartite 
matching  problem  and  to  the  zero-one  minimum-cost  flow  problem,  yielding 
algorithms-.  This  improves  previous  bounds  on  these  problems  and  illustrates  the  importance 
of  interior-point  methods  in  the  context  of  parallel  algorithm  design. 


1  Introduction 


In  this  paper  we  use  interior-point  methods  for  linear  programming,  developed  in  the  context 
of  sequential  computation,  to  obtain  a  parallel  algorithm  for  the  bipartite  iiiatchiiig  problem. 
Although  Karp,  Upfal,  and  Wigderson  [20]  have  shown  that  the  bipartite  matching  problem  is 
in  RNC  (see  also  [25]),  this  problem  is  not  known  to  be  in  NC.  Special  cases  of  the  problem  are 
known  to  be  in  NC.  Lev,  Pippenger,  and  Valiant  [23]  gave  an  NC  algorithm  to  find  a  perfect 
matching  in  a  regular  bipartite  graph.  (This  algorithm  is  based  on  a  seciuential  algorithm  of 
Gabow  and  Kariv  [12];  see  also  [5].)  Miller  and  Naor  [24]  gave  an  NC  algorithm  to  find  a  perfect 
matching  in  a  planar  bipartite  graph  (if  one  exists)  and  other  special  cases  are  considered  in  [16]. 

The  previous  best  deterministic  algorithm  for  the  problem,  due  to  Goldberg,  Plotkin,  and 
Vaidya  [13],  runs  in  0*(n^/®)  time.  This  algorithm  is  based  on  combinatorial  algorithms  for  the 
maximum  flow  and  bipartite  matching  problems  [6,  7,  8,  14,  17]  and  on  a  parallel  connectivity 
algorithm  [26].  In  this  paper  we  describe  an  0*{y/m)  algorithm  for  the  bipartite  matching  problem 
that  is  based  on  an  interior-point  algorithm  for  linear  programming  and  on  Gabovv's  algorithm  [11] 
for  edge- coloring  bipartite  graphs.  For  graphs  of  low-to-moderate  density,  tliis  bound  is  better 
than  the  best  previous  bound  mentioned  above. 

The  significance  of  the  bipartite  matching  problem  has  been  weU-recognized  in  the  context  of 
sequential  computation,  combinatorics,  and  graph  theory.  More  recently,  the  importance  of  the 
problem  for  parallel  computation  has  been  recognized  as  well.  The  efficiency  of  several  parallel 
algorithms  depends  on  the  parallel  complexity  of  the  bipartite  matching  problem.  For  example. 
Aggarwal  and  Anderson  [1]  and  Aggarwal,  Anderson,  and  Kao  [2]  show,  respectively,  that  an 
NC  algorithm  for  bipartite  matching  iihplies  NC  algorithms  for  the  problem  of  constructing  a 
depth- first  search  tree  in  undirected  and  directed  graphs. 

The  results  presented  in  this  paper  extend  to  the  maximum- weight  matching  problem  and  to 
the  zero-one  minimum-cost  flow  problem.  The  resulting  algorithms  run  in  0”'(v/mlogC)  time. 
The  previous  best  algorithm  for  the  zero-one  minimum-cost  flow  problem  runs  in  0*((nm)^/'^  log  C) 
time  [13].  The  new  algorithm  is  better  for  both  the  zero-one  maximum  flow  and  the  zero-one 
minimum-cost  flow  problems  for  all  graph  densities. 

'Throughout  the  paper,  n  and  m  denotes  the  number  of  nodes  and  edges  of  the  input  graph.  An  algorithm  runs 
in  0*{f{n))  time  if  it  runs  in  0(/(n)log*(n))  time  for  some  constant  k. 

^Throughout  the  paper  we  assume  that  all  costs  and  weights  are  integers  in  the  range  [— C . . .  C],  where  C  >  1. 
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An  interior-point  algorithm  works  as  follows.  The  algorithm  starts  with  a  point  in  the  interior 
of  the  feasible  region  of  the  linear  program.  In  its  main  loop,  the  algorithm  moves  from  one 
interior  point  to  another,  decreasing  the  value  of  a  potential  function  at  each  iteration.  When 
this  value  is  small  enough,  the  algorithm  terminates  with  an  interior-point  solution  that  has  a 
near-optimal  value.  The  finish-up  stage  of  the  algorithm  converts  this  near-optimal  solution  into 
an  optimal  basic  solution. 

Karmarkar’s  revolutionary  paper  [19]  spurred  the  development  of  the  area  of  interior-point 
linear  programming  algorithms,  and  many  papers  have  followed  his  lead.  Karmarkar’s  algorithm 
runs  in  0{NL)  iterations.^  Gonzaga  [15]  discovered  a  simple  variation  of  Karniarkar’s  algorithm 
that  uses  an  affine  transformation  instead  of  the  projective  transformation  used  by  Karmarkar. 
Renegar  [28]  was  the  first  to  give  an  interior-point  algorithm  that  runs  in  0{\/~N L)  iterations. 
A  different  0(-\/iVi)-iteration  algorithm  was  developed  by  Ye  [33],  which  is  an  improvement  of 
Gonzaga’s  algorithm  (a  similar  algorithm  is  described  in  [10]).  The  matching  algorithm  discussed 
in  this  paper  is  based  on  Ye’s  linear  programming  algorithm.  The  fastest  linear  programming 
algorithm  currently  known  is  due  to  Vaidya  [32].  This  algorithm  is  based  on  Renegar ’s  method 
and  terminates  in  the  same  number  of  iterations,  but  reduces  the  time  per  iteration  using  fast 
matrix  multiplication,  rank-one  updates,  and  careful  balancing.  See  [30]  for  a  survey  of  the 
interior-point  algorithms. 

Interior-point  algorithms  have  proved  to  be  an  important  tool  for  developing  efficient  sequential 
algorithms  for  linear  programming,  its  special  cases,  and  quadratic  programming  (see  e.g.  [18]). 
In  this  paper  we  apply  these  tools  in  the  context  parallel  computation.  For  the  purpose  of  parallel 
computation,  an  important  fact  is  that  the  running  time  of  an  iteration  of  an  interior-point 
algorithm  is  dominated  by  the  time  required  for  matrix  multiplication  and  inversion.  Therefore, 
an  iteration  of  such  an  algorithm  takes  0(log^  A)  time  on  a  PR.4M  using  processors  [26]. 

Roughly  speaking,  every  \fN  iterations  of  an  0{y/N L)  iteration  interior- point  algorithm  de¬ 
crease  the  gap  between  the  current  value  of  the  objective  function  and  the  optimum  value  by  a 
constant  factor.  The  bipartite  matching  problem  can  be  formulated  as  a  linear  program  with  an 
integral  optimum  value.  Therefore,  the  size  of  the  maximum  matching  is  known  as  soon  as  this 
gap  is  below  one.  Furthermore,  the  gap  between  the  value  of  an  initial  solution  and  the  optimal 
value  is  at  most  N.  In  Section  3,  we  give  such  a  formulation  with  N  =  0(m)  and  L  =  O(log  ii). 
This  suggests  that  an  interior-point  algorithm  can  be  used  to  find  the  value  of  the  maximum 
matching  in  a  bipartite  graph  in  0{^/rn\o%n)  iterations,  or  0'{\/m)  time.  In  this  paper  we 
develop  an  algorithm  running  in  this  time  bound  that  finds  a  maximum  matching  as  well  as  its 
value. 

For  this  we  need  to  overcome  two  difficulties.  First,  we  need  to  find  an  initial  interior  point 
with  small  potential  function  value,  so  that  the  number  of  iterations  is  small.  The  second  diffi¬ 
culty  comes  from  the  fact  that  standard  implementations  of  the  finish-up  stage  of  interior-point 
algorithms  either  are  inherently  sequential  or  perturb  the  input  problem  to  simplify  the  finish- 
up  stage,  which  makes  X,  and  therefore  the  number  of  iterations  of  the  main  loop,  superlinear. 


’A  and  L  denote  the  number  of  variables  and  the  size  of  the  linear  program.  See  Section  2  for  formal  definitions. 
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For  the  special  case  of  the  bipartite  matching  problem,  we  give  a  parallel  implementation  of  tlie 
finish-up  stage  that  runs  in  0(log^  n)  time  using  m  processors.  This  implementation  is  based  on 
Gabow’s  edge-coloring  algorithm  [11], 

Our  techniques  apply  to  the  more  general  maximum-weight  matching  problem.  The  algorithm 
and  its  analysis  are  only  slightly  more  involved  in  this  more  general  case,  and  for  brevity,  we  focus 
on  it.  The  results  for  bipartite  matching  are  obtained  as  a  simple  corollary  of  the  results  for 
weighted  bipartite  matching.  The  main  loop  of  our  maximum- weight  matching  algorithm  runs  in 
0*{^/m\o%C)  time,  and  the  finish-up  stage  runs  in  0*(logC)  time.  Therefore,  the  algorithm  runs 
in  0*{y/m\ogC)  time.  A  standard  reduction  between  the  weighted  matching  and  the  zero-one 
minimum-cost  flow  problems  (see  e.g.  [4,  20])  gives  0*[^/m\o^C)  algorithms  for  these  problems. 

This  paper  is  organized  as  follows.  Section  2  introduces  definitions  and  terminology  and 
reviews  Ye’s  linear  programming  algorithm.  Section  3  gives  a  linear  programming  formulation  of 
the  bipartite  matching  problem  that  has  an  initial  interior-point  with  a  small  potential  function 
value,  and  shows  how  to  use  the  linear  programming  algorithm  to  obtain  a  near-optimal  fractional 
matching.  Section  4  describes  a  parallel  procedure  that,  in  0*(log  C)  time,  converts  the  near- 
optimal  fractional  matching  into  an  optimal  zero-one  matching.  Section  -5  contains  concluding 
remarks. 


2  Preliminaries 


In  this  section  we  define  the  matching  problem  and  the  linear  programming  problem,  and  review 
some  fundamental  facts  about  them.  For  a  detailed  treatment,  see  [27,  29].  We  also  give  an 
overview  of  Ye’s  algorithm. 

The  bipartite  matching  problem  is  to  find  a  maximum  cardinality  matching  in  a  bipartite 
graph  G  =  {V,E).  The  maximum-weight  bipartite  matching  problem  is  defined  by  a  bipartite 
graph  G  =  (V,  E)  and  a  weight  function  on  the  edges  w  :  E  — >•  R.  The  weight  of  a  matching  .V/ 
Yle&M  The  problem  is  to  find  a  matching  with  maximum  weight. 

We  use  the  Mowing  notation  and  assumptions.  G  =  {V,E)  denotes  the  (bipartite)  input 
graph,  n  denotes  the  number  of  nodes  in  G,  m  denotes  the  number  of  edges  in  G,  and  C  denotes 
the  maximum  absolute  value  of  the  weights  of  edges  in  G,  which  we  assuihe  to  be  integral.  To 
simplify  the  running  time  bounds,  we  assume,  without  loss  of  generality,  that  m  >  n  —  1  >  I, 
and  C  >  1.  We  denote  the  degree  of  a  node  v  by  d{v),  and  the  set  of  edges  incident  to  node  v  l)y 
5(r;).  For  a  vector  x,  we  let  x{i)  denote  the  fth  coordinate  of  x.  We  use  a  CRCW  PRAM  [9]  as 
our  model  of  parallel  computation. 

It  is  well  known  that  the  node-edge  incidence  matrix  of  a  bipartite  graph  is  totally  unimodular. 
Therefore,  any  optimal  solution  of  the  following  linear  program  is  the  convex  combination  of 
maximum-weight  matchings,  and  hence  the  optimal  value  of  this  linear  program  is  equal  to  the 
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maximum  weight  of  a  matching. 

Matching- 1:  majcimize  w^f  'I 

subject  to:  Ee€fi(v)  /(«)  <  for  each  v  e  V,  i 

/  >  0.  J 

A  feasible  solution  to  the  system  of  above  linear  inequalities  is  called  a  fractional  matching.  We 
denote  an  optimal  solution  of  the  linear  program  by  /*. 

Ye’s  algorithm  handles  linear  programs  in  the  following  form: 

Primal  LP:  minimize  c^x  1 

subject  to:  Ax  =  b,  \ 

X  >  0,  I 

where  .4  is  a  matrix,  and  6,  c  and  x  are  vectors  of  appropriate  dimensions.  We  assume  that  the 
matrix  A  and  the  vectors  b  and  c  are  integral.  We  use  N  to  denote  the  number  of  variables  in  the 
linear  programs  we  consider.  A  vector  x  is  a  feasible  solution  if  it  satisfies  the  constraints  Ax  =  6 
and  X  >  0.  A  feasible  solution  x  is  optimal  if  it  minimizes  the  objective  function  value  c*x,  and  is 
an  interior  point  if  it  is  in  the  interior  of  the  feasible  region,  i.e.,  if  coordinates  of  x  are  positive. 

The  linear  programming  duality  theorem  states  that  the  minimum  value  of  the  Primal  LP  is 
equal  to  the  maximum  value  of  the  following  Dual  LP: 

Dual  LP:  maximize  6‘x 

subject  to:  A*ir  +  s  =  c,  > 

s  >  0. 

— 

where  tt  and  s  are  the  variables  of  the  Dual  LP,  the  dimension  of  tt  is  equal  to  the  dimension  of  b. 
and  the  dimension  of  s  is  equal  to  the  dimension  of  x.  Feasible  and  optimal  solutions  and  interior 
points  for  the  dual  problem  are  defined  in  the  same  way  as  for  the  primal. 

Let  X  be  a  feasible  solution  to  the  Primal  LP,  and  let  (Tr,^)  be  a  feasible  solution  to  the  Dual 
LP.  The  value  c^x  is  an  upper  bound,  and  b^Tt  is  a  lower  bound,  on  the  common  optimal  value  of 
the  two  problems.  Hence  the  difference  c^x  —  b^ir  =  s^x  measures  how  far  the  current  solutions 
are  from  being  optimal.  This  quantity  is  called  the  duality  gap, 

Ye’s  algorithm  is  based  on  algorithms  of  Gonzaga  [15]  and  Todd  and  Ye  [31].  Freund  [10] 
describes  a  very  similar  algorithm,  and  gives  a  detailed  discussion  of  a  good  choice  of  q  (defined 
below).  The  algorithm  is  applied  to  a  pair  of  primal  and  dual  linear  programs  in  the  above  form. 
It  starts  with  a  vector  (xq,  7ro,5o),  where  Xq  and  (tto,  5o)  are  interior  points  of  the  primal  and  dual 
linear  problems,  respectively.  At  each  iteration  of  the  main  loop,  the  algorithm  moves  either  from 
the  current  interior  point  of  the  primal  problem  to  another  interior  point  of  the  problem,  or  from 
the  current  interior  point  of  the  dual  problem  to  another  interior  point  of  the  problem.  Progress 
is  measured  by  a  potential  function 

$(x,  .s)  =  glog(x‘s)  -  log(x(t)5(  j))  -  N  log  N, 

i=l 
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where  q  -  N  +  \fN .  Each  iteration  reduces  this  potential  function  by  a  constant. 

The  number  of  iterations  of  interior-point  algorithms  depends  on  a  parameter  L  that  is  related 
to  the  size  of  the  input  numbers.  This  parameter  is  often  defined  to  be  the  total  number  of  bits 
in  the  binary  description  of  all  coefficients  in  A,  b  and  c.  We  use  a  different  definition  [19.  32], 
which  leads  to  a  much  smaUer  value  of  L  in  the  case  of  the  bipartite  matching  problem.  Let  D{A) 
denote  the  maximum  absolute  value  of  a  subdeterminant  of  A,  and  let  B  denote  the  maximum 
absolute  value  of  the  coefficients  of  b  and  c.  Then  L  is  defined  by 

L  =  log  D(  A)  -b  log  TV  -h  log  B. 

With  this  definition,  L  =  0(log(nC'))  for  linear  program  Matching-1. 

When  the  value  of  the  current  feasible  solution  x  is  less  then  2"^  away  from  the  optimal  value, 
a  standard  (sequential)  rounding  procedure  yields  an  optimal  solution. 

The  following  lemma  is  the  basis  for  the  analysis  of  Ye*s  algorithm. 


Lemma  ^1  [33]  If  we  have  an  initial  solution  (xo,7ro,So)  such  that  $(xo,5o)  <  0{\/NL),  then 
after  OiyNL)  iterations  the  duality  gap  x^s  <  2“^. 


Pcoot.  Recall  t^t  the  algorithm  decreases  the  potential  function  by  a  constant  per  iteration. 

Thu.'i  after  0(\/NL)  iterations  $(x,s)  <  —\/NL.  The  potential  function  can  be  rewritten  as 
follows: 


N 

${x,  s)  =  Vn  log(s‘x)  +  ^  log 

t=l 


s{i)x{i) 


TVlogiV. 


(1) 


Note  that  the  second  term  is  minimized  when  the  values  of  s(i)x(t)  are  the  same  for  all  /.  and 
therefore  this  term  is  at  least  TV  log  TV.  Therefore,  if  the  potential  function  value  is  at  most  -^ML, 
then  \/TVlog(s‘x)  <  —\fNL.  Hence  we  have  x^s  <  2~^.  | 


To  obtain  an  0{\/N  L)  bound  on  the  number  of  iterations,  one  has  to  provide  an  initial  solution 
(.To,  TTo,  .So)  with  #(xo,so)  <  yfNL.  Consider  the  potential  function  $  written  as  in  (1).  It  is  easy 
to  find  an  initial  solution  for  which  the  first  term  is  bounded  by  0(-/NZ).  The  difficulty  is  to 
guaiantee  that  the  second  term  is  fairly  close  to  the  TV  log  TV  lower  bound.  A  good  initial  solution 
is  one  where  the  terms  s(i)x(i)  are  almost  equal.  As  mentioned  in  [.3],  Ye  proposed  a  way  to 
obtain  an  equivalent  formulation  with  such  an  initial  solution.  This  uses  the  usual  definition  of 
X,  but  can  also  be  shown  to  work  for  the  definition  of  L  that  we  use  in  this  paper.  In  the  next 
section  we  provide  a  slightly  simplified  construction  for  the  bipartite  matching  problem. 


3  Finding  a  Near-Optimal  Solution 

In  this  section  we  show  how  to  convert  the  Matching- 1  linear  program  into  a  linear  program  that 
IS  in  the  form  required  by  Ye’s  algorithm  and  has  an  initial  solution  with  small  potential  function 


•o 


value.  Then  we  show  how  to  compute  a  near-optimal  fractional  matching  from  this  initial  solution. 
We  restate  the  matching  problem  as  follows; 


Matching-2:  minimize  —w^f  + 

subject  to:  ^  /(e)  -1-  (n  —  d(v))g(v)  —  z  =  1  for  each  v  6  V', 


e65(<-) 


l^f  +  l^g  +  y  =  n  +  m  +  l, 

f,9,2,y  >  0, 


(*) 


(2) 


where  1  denotes  the  vector  all  of  whose  coordinates  are  1.  We  denote  the  objective  function  of  this 
linear  program  by  c.  The  number  of  variables  in  this  linear  program  is  m-l-n-t-2  =  iV.  We  denote 
a  feasible  solution  to  Matching-2  by  a;  =  {  f,g,y,z),  and  a  feasible  solution  of  the  corresponding 
dual  problem  by  tt  and  s,  where  7r(0  for  i  =  is  the  dual  variable  corresponding  to  the 

primal  constraint  for  node  u,-,  and  7r(u  -h  1)  is  the  dual  variable  corresponding  to  the  constraint 
(*).  Note  that  for  this  linear  program,  L  =  0(log(nC)). 

Intuitively,  the  transformation  works  as  follows.  Variables  g{v)  are  the  slack  variables  intro¬ 
duced  to  replace  inequality  constraints  by  equality  constraints.  The  positive  multipliers  (n-d(u)) 
scale  the  slack  variables  so  that  there  is  a  feasible  solution  with  all  original  and  slack  variables 
equal.  The  coefficient  of  r  in  the  objective  function  is  large  enough  to  guarantee  that  r  =  0  in 
an  optimal  solution.  The  constraint  (*)  does  not  affect  the  primal  problem  since  y  is  not  in  the 
objective  function  and,  as  we  have  just  mentioned,  in  an  optimal  solution  z  =  0  and  therefore 
g*l  +  fl  <  n  is  automatically  satisfied.  This  constraint,  however,  allows  us  to  obtain  an  initial 
solution  for  the  dual  problem  such  that  the  dual  slack  variables  corresponding  to  the  primal  vari¬ 
ables  /,  g  and  y  are  roughly  equal.  A  natural  dual  solution  is  to  set  the  vector  tt  to  0,  and  set 
the  dual  slack  variables  equal  to  the  primal  objective  coefficients.  Even  if  this  were  feasible,  the 
coefficients  are  very  different  (the  coefficients  of  g  are  zero,  the  others  are  not),  and  so  this  is  not 
a  good  starting  solution.  However,  by  setting  the  coordinate  of  tt  corresponding  to  the  additional 
constraint  to  be  a  large  negative  number  (while  keeping  the  others  equal  to  0)  the  slacks  (except 
for  the  one  corresponding  to  z)  are  made  feasible,  and  roughly  equal  to  the  dual  variable  for  (*). 
The  variable  .r  is  introduced  to  make  it  possible  to  have  a  starting  primal  solution  with  coordinates 
of  /,  g  and  y  equal  (for  example,  to  1).  We  choose  the  dual  variable  corresponding  to  (+)  so  that 
for  the  initial  primal  solution  mentioned  above,  the  x{i)s{i)  terms  are  all  roughly  equal.  As  we 
shall  see,  this  results  in  a  small  initial  value  of  the  second  term  of  $  (written  as  in  (I)). 

We  define  initial  primal  and  dual  solutions  «is  suggested  by  the  above  discussion.  The  initial 
primal  solution  xq  is  defined  by 


f  =  l,g=  l,y  =  1,  z  =  n  -  1. 

The  initial  dual  solution  (xq,  so)  is  defined  by 

n{i)  =  0,  for  1  <  f  <  n, 

7r(?r+  1)  =  -N'^C, 

s{i)  =  c{i)  +  NK:a(i)  for  1  <  i  <  A, 
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where  a(i)  is  the  t-th  coefficient  of  the  equation  (*)  in  the  definition  of  the  Matching-2  LP. 
The  following  two  lemmas  formalize  the  above  intuition. 


Lemma  3.1  If  {f,g,y,z)  is  an  optimal  solution  of  Matching-2,  then  /  is  an  optimal  solution  to 
Matching-1. 


Proof  :  It  suffices  to  show  that  every  optimal  solution  to  Matching-2  has  ^  =  0.  Consider  a  feasible 
point  a,’!  =  with  z\  ^  0.  Since  f\  satisfies  Y^e^S(v)  hi^)  <  1  +  for  every  node 

V,  decreasing  fi  on  some  edges,  by  a  total  of  at  most  zin,  converts  fi  into  a  vector  /2  that  is  a 
fractional  matching.  Note  that  any  fractional  matching  /  can  be  extended  to  a  feeisible  solution  of 
Matching-2.  Let  X2  denote  a  feasible  solution  extending  /j.  If  we  replace  .ti  by  X2,  the  decrease  in 
the  objective  function  value  caused  by  the  reduction  in  2  is  >  ZiNC.  The  increase  due  to 

the  change  in  /  is  bounded  by  zinC  <  Z\NC .  Therefore,  the  value  c^X2  is  smaller,  which  implies 
that  any  optimal  solution  must  have  2  =  0.  | 


Lemma  3.2  The  vectors  Xq  and  (5ro,So)  are  interior-point  solutions  of  the  primal  and  the  dual 
problems,  respectively.  The  value  of  the  potential  function  $(.to,  so)  is  at  most  OfViV  log(nC)). 


Proof:  The  first  claim  of  the  lemma  is  easy  to  verify.  To  verify  the  second  claim,  consider  the 
potential  function  written  as  in  (1).  The  first  term  is  at  most  0(\/iV  log(nC)).  VVe  show  that  the 
second  term  is  at  most  iV  log  AT -1-0(1).  First  we  show  that  for  every  i,  .s*x/{s{i)x{i))  <  N  +  0(1). 
Recall  that  N  =  n  +  m  +  2  and  note  that 

50*2^0  =  nN^C  -h  mN^C  -  w^l  -|-  2N^C. 

Consider  each  type  of  variable  separately. 


For  variables  .s(<)  and  x{i)  corresponding  to  2,  y,  and  g,  we  get 


sixo/(so{i)xo{i))  =  ra  -I-  m  -I-  2  -  <  -V  -t-  0(1). 

iV  ^0 

For  variables  s(i)  and  x-(f)  corresponding  to  /,  we  get 

4xo/(.o(.>„(j))  =  n  +  m  +  2  -  -‘1+ "-KO  + -...W  +  2^.)  „  ^  ^d). 

N^C  —  w{z) 


Since  log (1  +  h)  <  /z  for  /i  >  —  1,  the  above  calculations  imply  that 

^  t 


t=l 


<  N  \og{N  -b  0(1))  <  N  log  N  +  0(1). 
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Now  we  are  ready  to  give  the  O"‘(v^log  C)-time  algorithm  to  compute  the  weight  of  an 
optimal  matching  and  to  find  a  near-optimal  fractional  matching.  In  the  next  section  we  show 
how  to  find  an  optimal  matching. 

The  following  lemma  is  based  on  the  fact  that  the  objective  coefficient  of  z  has  been  chosen 
large  enough  to  ensure  that  any  near-optimal  solution  to  Matching-2  can  be  rounded  to  a  nearby 
feasible  solution  to  Matching- L 


Lemma  3.3  A  fractional  bipartite  matching  with  weight  at  most  1/2  less  than  the  weight  of  an 
optimal  matching  can  be  computed  in  0*[y/m\ogC)  time  on  a  PRAM  with  processors. 

Proof:  Lemmas  2.1  and  3.2  imply  that,  after  0(\/iV  log(nC))  =  0{y/m\og{nC))  iterations  of  the 
LP  algorithm,  we  obtain  a  point  (a;,x,^)  with  duality  gap  x^s  <  1/4.  Hence  we  have 

1 

+  (3) 

where  /*  is  an  optimal  solution  to  Matching- 1.  Since  s  >  0,  this  implies  that  w^f“  -  w^f  <  1/4. 
As  in  Lemma  3.1,  we  can  argue  that  /  can  be  converted  to  a  feasible  solution  of  the  Matching- 
1  problem  by  decreasing  its  value  on  some  of  tlie  edges  by  a  total  of  at  most  zn.  Therefore, 
w^f*  >  w^f  —  znC.  From  (3),  this  implies  that  <  1/4  +  znC.  Thus, 

n  -  I  1 

^  ~  4C(iV2  -  n)  ^  47nC'’ 

Now  round  all  values  of  /  and  g  down  to  have  a  common  denominator  4mC,  and  denote  the 
rounded  solution  by  f\,gi.  Clearly,  iv^f*  -  w^fi  <  l/4  +  (mC')/(4/7zC)  <  1/2.  After  the  rounding, 
we  have: 

A(e)  +  («  -  d{v))gi{v)  <  1  +  5 

e€5(t^) 

The  left-hand  side  is  an  integer  multiple  of  (4C'7?r  )“^  and  2  <  (4C7n)~^  This  implies  that 

e£S(v) 

Hence,  the  resulting  vector  fi  is  a  fractional  matching  whose  weight  is  within  1/2  of  the  optimum. 

I 

Corollary  3.4  A  fractional  bipartite  matching  with  cardinality  at  most  1/2  less  than  that  of  the  max¬ 
imum  cardinality  matching  can  be  computed  in  0*{\/m\ogC)  time  on  a  PRAM  with  processors. 
The  cardinality  of  the  maximum  matching  can  be  computed  within  the  same  bounds. 
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4  The  Finish-Up  Stage 


In  the  previous  section  we  have  shown  how  to  compute,  in  0*(  v/mlog  C)  time,  a  fractional  bipar¬ 
tite  matching  with  weight  at  most  1/2  less  than  the  optimum.  In  this  section  we  give  an  O-ilogT') 
algorithm  for  converting  any  such  fractional  matching  into  a  maximum-weight  matching.  Note 
that  for  the  unweighted  bipartite  matching,  this  algorithm  runs  in  polylogarithmic  time. 


Let  /  be  a  fractional  bipartite  matching  which  has  weight  at  most  1/2  less  than  the  maximum 
weight.  First  we  construct  a  fractional  matching  /',  such  that  the  values  of  /'  have  a  relatively 
small  common  denominator  that  is  a  power  of  two  and  the  weight  of  /'  differs  from  the  maximum 
weight  by  less  than  1.  Define  A  by 

A  = 


By  definition,  A  is  an  integer  power  of  2  and  A  =  0{mC).  Let  f  be  the  fractional  matching 
obtained  by  rounding  /  down  to  the  nearest  multiple  of  1/A.  Note  that 


IwV  -  u;7'l  < 


mC 


mC 

2riogmCl-H 


Therefore  m*/*  —  <  1. 


Next  we  show  how  to  construct  from  /'  a  multi-graph  that  will  allow  us  to  fuid  /’.  Consider 
a  multi-graph  G'  =  {V,E')  with  the  edge  set  containing  A  •  /'(e)  copies  of  e  for  every  e  e  E. 
and  no  other  edges.  The  following  lemma  shows  a  relationship  betweeng  this  multigraph  and 
maximum-weight  matchings  of  G. 


Lemma  4.1  For  any  coloring  of  the  edges  of  G'  with  A  colors,  there  exists  a  color  class  which  is  a 
maximum-weight  matching  of  C. 


Proof  :  The  proof  is  by  a  simple  counting  argument.  The  sum  of  the  weights  of  the  color  classes 
is  equal  to  Aw^f'  >  A(w*f*  —  1).  Since  there  are  A  color  classes,  at  least  one  of  them  has  woiglit 
above  in'/*  -  1-  The  claim  follows  from  the  integrality  of  w.  | 

The  above  lemma  implies  that,  in  order  to  find  a  maximum  weight  matching,  it  is  sufficient  to 
edge-color  G'  using  A  colors.  Since  G'  is  a  bipartite  graph  and  its  maximum  degree  is  bounded 
by  A,  which  is  a  power  of  2,  we  can  use  a  parallel  implementation  of  Gabow’s  algorithm  [LI]  to 
edge-color  G'  using  A  colors.  However,  G'  has  0{mC)  edges  and  therefore  the  algorithm  uses 
fl{mC)  processors.  In  order  to  reduce  the  processor  requirement,  we  use  a  somewhat  different 
algorithm.  The  algorithm  does  not  use  an  explicit  representation  of  the  multigraph,  but  rather 
uses  a  weighted  representation  of  a  simple,  graph.  A  divide-and-conquer  approach  is  then  used  to 
split  the  (implicit)  multigraph  so  that  the  bound  on  the  maximum  weight  of  an  edge  is  halved, 
and  then  recurses  on  the  part  with  greater  weight.  A  subroutine  to  find  such  a  partitioning  is 
also  the  basis  of  Gabow’s  edge-coloring  algorithm. 

Figure  1  describes  the  algorithm  to  find  a  maximum- weight  matching  given  a  near-optimal 
fractional  matching.  The  algorithm  starts  by  rounding  the  fractional  matching  to  a  small  common 
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procedure  Round{E,  f)-, 

A  2n°8"‘^1  +  l  ; 

f  ^  f  rounded  down  to  a  common  denominator  of  A; 
d'  <-  A; 

while  d'  >  1  do  begin 

EQ^{e\eeE,d'  -fie)  is  odd}; 

(£'1,^2)  ♦—  Degree-Split(V,  Eo)', 
luiEi)-, 

W2  *—  w(E2); 
if  Wi  >  W2 
then  begin 

fore€£i  do  /'(e)  ^ /'(e)  +  I/d'; 
for  e  €  £2  do  /'(e)-<—  /'(e)  -  I/d'; 
end; 

else  begin 

for  e  €  £2  do  /'(e)  ^  /'(e)  +  I/d'; 
for  e  €  £1  do  /'(e)  ^  /'(e)  -  I/d'; 
end; 
d'  ^  d'/2; 
end; 

return  ({e  |  /'(e)  =  1}) 

end. 


Figure  1:  Rounding  an  approximate  fractional  matching  to  an  optimal  integral  one 

denominator  as  described  above.  Then  it  computes  from  the  fractional  matching  /'  with  common 
denominator  A,  two  fractional  matchings  /i  and  /2  such  that  /'  =  j(/i  +  /2)  and  both  /i  and  /2 
have  common  denominator  A/2.  This  is  accomplished  with  the  help  of  the  procedure  Degree-split 
that  partitions  the  edges  of  a  bipartite  graph  Go  =  {V,Eo)  into  two  classes  Ei  and  E2,  so  that 
for  every  node  v,  the  degree  of  v  in  the  two  induced  subgraphs  differs  by  at  most  one.  Then 
/'  is  replaced  by  fi  or  /2  depending  on  which  one  has  larger  weight.  This  process  is  iterated 
G(log(nC))  times,  until  the  current  fractional  matching  is  integral.  This  matching  has  an  integral 
weight  that  is  more  than  —  1,  and  therefore  the  matching  is  optimal. 


Lemma  4.2  The  algorithm  Round  produces  a  maximum-weight  matching. 


Proof:  Consider  the  parameter  d'  used  in  the  algorithm  in  Figure  1.  Initially  d’  =  A.  Note  that 
after  iteration  i  we  have  d'  =  A/2'.  We  show  by  induction  that  after  iteration  i: 

•  /'  is  a  fractional  matching, 

•  w*'f'  >  w^f*  —  1, 

•  coordinates  of  /'  have  common  denominator  d' . 
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procedure  Degree-SplHiV,  E)\ 

Construct  a  new  node  set  by  replacing  each  node  v  G  K  by  an  independent  set  of  size  |‘d(r)/2]: 
For  each  node  in  7,  assign  its  incident  edges  to  nodes  in  7',  so  that  each  node  v  in  V'  has  d{v)  <2\ 
Edge-color  the  resulting  graph  using  two  colors; 

Return  the  edges  of  each  color  class; 

end. 


Figure  2:  Splitting  the  maximum  degree  of  the  graph 

Initially  all  three  conditions  are  satisfied.  Assuming  that  ail  three  conditions  are  satisfied  after 
iteration  i  —  1,  we  prove  that  they  remain  satisfied  after  iteration  i.  Let  di  and  fi  denote  d'  and 
/'  before  iteration  i  and  let  ^2  and  /2  denote  d'  and  /'  after  iteration  L  The  last  claim  follows 
from  the  fact  that  the  coordinates  of  fi  that  are  odd  multiples  of  1/di  are  adjusted  by  1/di  in 
this  iteration,  and  so  all  coordinates  of  /2  are  even  multiples  of  1/di,  and  hence  multiples  of 
l/d2.  The  second  claim  follows  from  the  fact  that  the  components  of  /2  that  have  been  increased 
correspond  to  edges  of  greater  total  weight  than  those  that  have  been  decreased.  Now  consider 
the  first  claim.  By  the  inductive  assumption,  X2e€c5(v)  /i(^)  ^  definition  of  Procedure 

Degree-split,  /2(e)  <  J2ees(v)  /i(^)  +  ^  1  +  l/^i*  However,  we  have  seen  already  that 

/2  has  a  common  denominator  of  d2.  Hence,  integer  multiple  of  l/d2  =  2/di 

and  therefore  at  most  one. 

After  log  A  iterations  we  construct  an  /'  that  is  integral  and  whose  weight  is  above  —  1. 
By  the  integrality  of  w,  the  set  of  edges  where  this  f  is  1  is  the  desired  maximum-weight  matching 
of  the  input  graph.  | 

The  Degree-Split  procedure  is  described  in  Figure  2.  The  following  two  lemmas  imply  the 
desired  time  bound. 

Lemma  4.3  The  procedure  Degree-Split  partitions  the  input  graph  into  two  graphs  with  disjoint 
edge-sets,  such  that  the  degrees  of  any  node  v  in  the  two  graphs  differ  by  at  most  one.  The  procedure 
runs  in  O(logn)  time. 

Proof:  Observe  that  the  graph  constructed  on  7'  is  bipartite,  and  the  degree  of  a  node  is  at  most 
two.  Therefore  the  graph  consists  of  paths  and  even  cycles.  Hence  it  can  be  two  edge-colored  in 
0(log7i)  time  using  m  processors  [21,  22].  The  claim  of  the  lemma  follows  from  the  fact  that  each 
node  €  7  is  an  end  point  of  at  most  one  path.  | 


Lemma  4.4  The  algorithm  Round  runs  in  O(log nlog nC)  time  using  m  processors. 


Proof:  The  number  of  iterations  of  the  loop  of  the  algorithm  is  0(log  A)  =  O(lognC),  because 
d  is  halved  at  each  iteration.  The  running  time  of  each  iteration  is  dominated  by  Degree-Split. 
which  takes  0{logn)  time  by  Lemma  4.3.  | 
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Corollary  4.5  On  unweighted  bipartite  matching  problem,  the  algorithm  Round  runs  in  O(log'  /;) 
time  using  m  processors. 


Theorem  4.6  A  maximum-weight  bipartite  matching  can  be  computed  in  0*(>/mlogC)  time  using 
processors. 


Proof:  Immediate  from  Lemmas  3.3  and  4.4.  | 


Corollary  4.7  A  maximum  cardinality  bipartite  matching  can  be  computed  in  time  using 

processors. 


5  Conclusions 


Interior-point  methods  have  proved  to  be  very  powerful  in  the  context  of  sequential  computation. 
In  this  paper  we  show  how  to  apply  these  methods  to  the  design  of  parallel  algorithms.  We  believe 
that  these  methods  will  find  more  applications  in  the  context  of  parallel  computation,  and  would 
like  to  mention  the  following  two  research  directions. 

One  direction  is  to  attempt  to  generalize  our  result  to  general  linear  programming,  showing 
that  any  linear  programming  problem  can  be  solved  in  0*{y/NL)  time.  This  would  require  a 
parallel  implementation  of  the  finish-up  stage  of  the  algorithm  that  runs  in  0"[\/N L)  time.  .4 
related  question  is  whether  the  problem  of  finding  a  vertex  of  a  polytope  with  objective  function 
value  smaller  than  that  of  a  given  interior  point  of  the  polytope  is  P-complete. 

The  other  direction  of  research  is  to  attempt  to  use  the  special  structure  of  the  bipartite  match¬ 
ing  problem  to  obtain  an  interior- point  algorithm  for  this  problem  that  finds  an  almost-optimal 
fractional  solution  in  less  that  0*{^/m)  time;  an  bound  would  be  especially  interesting, 

since  in  combination  with  results  of  Section  4  it  would  imply  that  bipartite  matching  is  in  NC. 


References 

[1]  A.  Aggarwal  and  R.  J.  Anderson.  A  Random  NC  Algorithm  for  Depth  First  Search.  In  Proc. 
19th  ACM  Symposium  on  Theory  of  Computing,  pages  325-334,  1987. 

[2]  A.  Aggarwal,  R.  J.  Anderson,  and  M.  Y.  Kao.  Parallel  Depth-First  Search  in  General  Directed 
Graphs.  In  Proc.  21st  ACM  Symposium  on  Theory  of  Computing,  page  (to  appear),  1989. 

[3]  K.  M.  Anstriecher  and  R.  A.  Bosch.  Long  Steps  in  an  0{n^L)  Algorithm  for  linear  program¬ 
ming.  Unpublished  manuscript,  Yale  School  of  Management,  Yale  University,  1989. 


12 


[4]  A.  K.  Chandra,  L.  Stockmeyer,  and  U.  Vishkin.  Constant  depth  reducibility.  SIAM  J. 
Comput.,  13(2):423-439,  May  1984. 

[o]  R.  Cole  and  J.  Hopcroft.  On  Edge  Coloring  Bipartite  Graphs.  SIAM  J.  Coinput.,  11:540-546. 
1992. 

[6]  E.  A.  Dinic.  Algorithm  for  Solution  of  a  Problem  of  Maximum  Flow  in  Networks  with  Power 
Estimation.  Soviet  Math.  Dokl.,  11:1277-1280,  1970. 

[7]  S.  Even  and  R.  E.  Tarjan.  Network  Flow  and  Testing  Graph  Connectivity.  SLAM  J.  Comput., 
4:507-518,  1975. 

[8]  L.  R.  Ford,  Jr.  and  D.  R.  Fulkerson.  Flows  in  Networks.  Princeton  Univ.  Press,  Princeton. 
NJ.,  1962. 

[9]  S.  Fortune  and  J.  WyUie.  Parallelism  in  Random  Access  Machines.  In  Proc.  10th  ACM 
Symp.  on  Theory  of  Computing,  pages  114-118,  1978. 

[10]  R.  M.  Freund.  Polynomial-Time  Algorithms  for  Linear  Programming  Based  only  on  Primal 
Scaling  and  Projective  Gradients  of  a  Potential  Function.  Technical  Report  OR- 182-88, 
Operations  Research  Center,  M.I.T.,  1988. 

[11]  H.  N.  Gabow.  Using  Euler  Partitions  to  Edge-Color  Bipartite  Multi-graphs.  Int.  J.  Comput. 
Inform.  ScL,  5:345-355,  1976. 

[12]  H.  N.  Gabow  and  0.  Kariv.  Algorithms  for  Edge-Coloring  Bipartite  Graphs  and  Multigraphs. 
SIAM  .J.  Comput.,  11:117-129,  1982. 

[13]  A.  V.  Goldberg,  S.  A.  Plotkin,  and  P.  M.  Vaidya.  Sublinear-Time  Parallel  Algorithms  for 
Matching  and  Related  Problems.  In  Proc.  29th  IEEE  Symp.  on  Found,  of  Comp.  Sci.,  pages 
174-185,  1988. 

[14]  .4.  V.  Goldberg  and  R.  E.  Tarjan.  A  New  Approach  to  the  Maximum  Flow  Problem.  J. 
A.9SOC.  Comput.  Mach.,  35:921-940,  1988. 

[15]  C.  C.  Gonzaga.  An  Algorithm  for  Solving  Linear  Programming  in  0{n^L)  Operations.  In 
N.  Megiddo,  editor.  Progress  in  Mathematical  Programming,  pages  1-28.  Springer  Verlag, 
Berlin,  1989. 

[16]  D.  \ .  Grigoriev  and  M.  Karpinski.  The  Matching  Problem  for  Bipartite  Graphs  with  Polyno- 
miaUy  Bounded  Permanents  is  in  NC.  In  Proc.  28th  IEEE  Symp.  on  Foundations  of  Comp. 
Sci.,  pages  166-172,  1987. 

[17]  J.  E.  Hopcroft  and  R.  M.  Karp.  An  Algorithm  for  Maximum  Matching  in  Bipartite 
Graphs.  SIAM  J.  Comput.,  2:225-231,  1973. 

[18]  S.  Kapoor  and  P.  M.  Vaidya.  Fast  Algorithms  for  Conve.x  Quadratic  Programming  and 
Multicommodity  Flows.  In  Proc.  18th  ACM  Symp.  on  Theory  of  Computing,  pages  147-159, 
1986. 


13 


[19]  N.  Karmarkar.  A  New  Polyiiomial-Time  Algorithm  for  Linear  Programming.  Combinatorico. 
4:373-395,  1984. 

[20]  R.  M.  Karp,  E.  Upfal,  and  A.  VVigderson.  Constructing  a  Maximum  Matching  is  in  Random 
NC.  Combinatorica,  6:35-48,  1986. 

[21]  R.  E.  Ladner  and  M.  J.  Fischer.  Parallel  prefix  computation.  J.  Assoc.  Comp.  .Mach.. 
27:831-838,  1980. 

[22]  C.  Leiserson  and  B.  Maggs.  Communication-efficient  parallel  graph  algorithms.  In  Proc.  of 
International  Conference  on  Parallel  Processing,  pages  861-868,  1986. 

[23]  G.  F.  Lev,  N.  Pippenger,  and  L.  G.  Valiant.  A  Fast  Parallel  Algorithm  for  Routing  in 
Permutation  Networks.  IEEE  Trans,  on  Comput,  C-30:93-100,  1981. 

[24]  G.  L.  Miller  and  J.  Naor.  Flow  in  Planar  Graphs  with  Multiple  Sources  and  Sinks.  Un¬ 
published  Manuscript,  Computer  Science  Department,  Stanford  University,  Stanford,  CA, 
1989. 

[25]  K.  Mulmuley,  U.  V.  Vazirani,  and  V.  V.  Vazirani.  Matching  is  as  easy  as  matrix  inversion. 
Combinatorica,  pages  105-131,  1987. 

[26]  V.  Pan  and  J.  Reif.  Efficient  Parallel  Solution  of  Linear  Systems.  In  Proc.  17th  .ACM 
Symposium  on  Theory  of  Computing,  pages  143-152,  1985. 

[27]  C.  H.  Papadimitriou  and  K.  Steiglitz.  Combinatorial  Optimization:  Algorithms  and  Com¬ 
plexity.  Prentice- HaU,  Englewood  Cliffs,  NJ,  1982. 

[28]  J.  Renegar.  A  Polynomial  Time  Algorithm,  Based  on  Newton’s  Method,  for  Linear  Program¬ 
ming.  Mathematical  Programming,  40:59-94,  1988. 

[29]  A.  Schrijver.  Theory  of  Linear  and  Integer  Programming.  J.  Wiley  &  Sons,  1986. 

[30]  M.  J.  Todd.  Recent  Developments  and  New  Directions  in  Linear  Programming.  Technical 
Report  829,  School  of  Operations  Research  and  Industrial  Engineering,  Cornell  University. 

1988. 

[31]  M.  J.  Todd  and  Y.  Ye.  A  Centered  Projective  Algorithm  for  Linear  Programming.  Technical 
Report  763,  School  of  Operations  Research  and  industrial  Engineering,  Cornell  Universitv. 

1989. 

[32]  P.  M.  Vaidya.  Speeding  up  Linear  Programming  Using  Fast  Matrix  Multiplication.  Technical 
Memorandum,  AT&T  Bell  Laboratories,  Murray  Hill,  NJ,  1989. 

[33]  Y.  Ye.  An  0{n^L)  Potential  Reduction  Algorithm  for  Linear  Programming.  Unpublished 
manuscript.  The  University  of  Iowa,  1989. 


14 


